An outlier mining algorithm based on constrained concept lattice

نویسندگان

  • Jifu Zhang
  • Sulan Zhang
  • Kai-Hsiung Chang
  • Xiao Qin
چکیده

An Outlier Mining Algorithm Based on Constrained Concept Lattice Jifu Zhang , Sulan Zhang , Kai H. Chang b, and Xiao Qin a School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan, P. R. China 030024 b Department of Computer Science and Software Engineering, Auburn University, Auburn, AL, USA 36849-5347 [email protected] Abstract: Traditional outlier mining methods identify outliers from a global point of view. These methods are inefficient to find locally-biased data points (outliers) in low dimensional subspaces. Constrained concept lattices can be used as an effective formal tool for data analysis because constrained concept lattices have the characteristics of high constructing efficiency, practicability, and pertinency,. In this paper,we propose an outlier mining algorithm that by treats the intent of any constrained concept lattice node as a subspace. We introduce sparsity and density coefficientsto measure outliers in low dimensional subspaces. The intent of any constrained concept lattice node is regarded as a subspace,and sparsity subspaces are searched by traversing the constrained concept lattice according to a sparsity coefficient threshold. If the intent of any father node of the sparsity subspace is a density subspace according to a density coefficient threshold, then objects contained in the extent of the sparsity subspace node are considered as bias data points or outliers. Our experimental results show that the proposed algorithm performs very well for high red-shift spectral data sets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Outlier-based Data Association: Combining OLAP and Data Mining

Both data mining and OLAP are powerful decision support tools. However, people use them separately for years: OLAP systems concentrate on the efficiency of building OLAP cubes, and no statistical / data mining algorithms have been applied; on the other hand, statistical analysis are traditionally developed for two-way relational databases, and have not been generalized to the multi-dimensional ...

متن کامل

A Classification Rule Acquisition Algorithm Based on Constrained Concept Lattice

Concept lattice is an effective tool for data analysis. Constrained concept lattice, with the characteristics of higher constructing efficiency, practicability and pertinence, is a new concept lattice structure. For classification rule acquisition, a classification rule acquisition algorithm based on the constrained concept lattice is presented by using the concept of partition support accordin...

متن کامل

A Frequent Pattern Mining Algorithm Based on Concept Lattice

The concept lattice is an effective tool for data analysis and rule extraction, it is often well to mine frequent patterns by making use of concept lattice. In this paper, a frequent itemset mining algorithm FPCL based on concept lattice which builds lattice in batches, the algorithm builds lattice down layer by layer through the layer concept nodes and temporary nodes based on hierarchical con...

متن کامل

RODHA: Robust Outlier Detection using Hybrid Approach

The task of outlier detection is to find the small groups of data objects that are exceptional to the inherent behavior of the rest of the data. Detection of such outliers is fundamental to a variety of database and analytic tasks such as fraud detection and customer migration. There are several approaches[10] of outlier detection employed in many study areas amongst which distance based and de...

متن کامل

Efficient Mining of Association Rulesusing Closed

| Discovering association rules is one of the most important task in data mining. Many eecient algorithms have been proposed in the literature. The most noticeable are Apriori, Mannila's algorithm, Partition, Sampling and DIC, that are all based on the Apriori mining method: pruning the subset lattice (itemset lattice). In this paper we propose an eecient algorithm, called Close, based on a new...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. J. Systems Science

دوره 45  شماره 

صفحات  -

تاریخ انتشار 2014